Lambdamart implementation by srinikrish22 · Pull Request #52 · kiudee/cs-ranking

srinikrish22 · 2019-08-26T11:49:49Z

I have implemented the LambdaMART algorithm for object ranking.

Description

I have created a class with the required core functionality of lambdamart and added ndcg metric implementation to the utils. I have added the ranker into the test_ranking and also added the ranker definition in the constants.

Motivation and Context

This change aims at adding an implementation of a new ranker for object ranking to the repository.

How Has This Been Tested?

I have added the possibility to test this ranker under the existing testing infrastructure of the cs-ranking repository.

Does this close/impact existing issues?

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have added tests to cover my changes.

… metric calculation related implementation. Also added the ranker to the tests

coveralls · 2019-08-26T14:19:12Z

Pull Request Test Coverage Report for Build 629

165 of 464 (35.56%) changed or added relevant lines in 44 files are covered.
1490 unchanged lines in 45 files lost coverage.
Overall coverage decreased (-22.2%) to 31.561%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
csrank/choicefunction/fate_choice.py	0	1	0.0%
csrank/choicefunction/fatelinear_choice.py	0	1	0.0%
csrank/choicefunction/fetalinear_choice.py	0	1	0.0%
csrank/choicefunction/generalized_linear_model.py	1	2	50.0%
csrank/choicefunction/pairwise_choice.py	1	2	50.0%
csrank/choicefunction/ranknet_choice.py	0	1	0.0%
csrank/dataset_reader/discretechoice/letor_listwise_discrete_choice_dataset_reader.py	1	2	50.0%
csrank/dataset_reader/labelranking/survey_dataset_reader.py	2	3	66.67%
csrank/dataset_reader/objectranking/depth_dataset_reader.py	0	1	0.0%
csrank/learner.py	0	1	0.0%

Files with Coverage Reduction	New Missed Lines	%
csrank/dataset_reader/expedia_dataset_reader.py	1	18.66%
csrank/dataset_reader/letor_listwise_dataset_reader.py	1	13.0%
csrank/dataset_reader/letor_ranking_dataset_reader.py	1	13.22%
csrank/dataset_reader/util.py	1	17.7%
csrank/dataset_reader/discretechoice/util.py	2	12.07%
csrank/objectranking/object_ranker.py	3	46.67%
csrank/discretechoice/discrete_choice.py	5	41.18%
csrank/numpy_util.py	6	57.89%
csrank/objectranking/fate_object_ranker.py	6	73.33%
csrank/layers.py	7	68.57%

Totals
Change from base Build 437:	-22.2%
Covered Lines:	2481
Relevant Lines:	7861

💛 - Coveralls

Keras 2.3 renamed `lr` to `learning_rate`. It is not entirely clear what convention `tf.keras` will follow (cf. keras-team/keras#13393), therefore we will deal with this when we transition to tf2 and tf.keras.

After changing the dependency to Tensorflow<2.0 the ranking tests were failing, due to small variations in loss values. To make the tests more robust, we require them to converge to 0 loss now and thus made the test learning problem much simpler to learn. This is an overview of the changes: * Pin Tensorflow to version 1.x * Improve convergence and speed of ranking tests - Reduce the number of instances of the learning problem - Require learners to reach exactly 0 loss.

Save State

Renamed the package to choice functions Added a class for standardizing the features Removed the bug in predicting the scores using pairs for feta discrete choice Reformatted the code

-Added the ignore folder in gitignore

- Added another baseline for Object Ranking problem

From http://olivier.chapelle.cc/pub/err.pdf.

We need to escape backslashes in latex code, since python otherwise interprets \c as an escaped unicode character. Fixes kiudee#59

Apparently tf.keras now uses[1] `learning_rate` instead of `lr` too, so we should switch. [1]keras-team/keras#13393

We do not have control about any of these warnings except the last one. And that one is harmless in a testing context. Maintaining a list of these filters is cumbersome. I wish there was some way to automatically ignore all warnings caused by third party code, but unfortunately I don't think there currently is. I've opened pytest-dev/pytest#6191 to discuss this. I still think it is worthwhile to filter these warnings, because otherwise they drown out those warnings that are actually relevant.

Out of our control. Fixes kiudee#74

… metric calculation related implementation. Also added the ranker to the tests

…o object ranking constants list

kiudee · 2020-01-15T14:48:06Z

Thank you for the new work on the implementation. Could you rebase your branch onto master and force push it here? Then it would be easier to review the changes.

prithagupta · 2020-01-15T15:14:28Z

I would agree with @kiudee.
@srinikrish22 Secondly, regarding the tests, I would suggest that you follow the pull request #80
run the test locally to just check the environment problem.
The current error is due coverage, which should not have any effect on your implementation work

… metric calculation related implementation. Also added the ranker to the tests

…o object ranking constants list

…rish22/cs-ranking into lambdamart-implementation

srinikrish22 · 2020-01-15T22:46:45Z

I have finished the rebase @kiudee. I had done it with my master but forgot to rebase my branch on my master. Now it is be done.
@prithagupta I tried to run the tests locally but my tensorflow installation is broken. I am currently trying to get the tests running on a colab instance.

prithagupta · 2020-01-17T09:47:34Z

@srinikrish22 Did you update the tensorflow, the 2.0 version should have a lot of problems. as far as i know, your ranker does not use tensorflow. So you should just fix your version of the last stable release of the tensorflow.

kiudee · 2020-01-20T08:57:35Z

@srinikrish22 I think you need to first pull the latest master before you do the rebase. Currently, there are still commits in this branch which do not belong to your changes (example 6c4c30c)

srinikrish22 · 2020-01-21T22:33:50Z

@kiudee I did rebase on the latest master branch recently. Those changes are because of the merge conflicts from the rebase. The last comment on the latest master is 229d5dd on the 19th of November. All my commits then continue after this change.

srinikrish22 · 2020-01-27T22:17:52Z

@kiudee I have created a new pull request to clarify the confusion about rebase with the current master. This should make things easier to compare the changes.

timokau · 2020-03-26T20:29:26Z

Closing this in favor of #82.

srinikrish22 added 2 commits August 26, 2019 13:33

Initial Commit. Added primary lambdamart class implementation and its…

678f523

… metric calculation related implementation. Also added the ranker to the tests

Added ranker to constants, made changes to tests and init

eaff70c

kiudee added the enhancement New feature or request label Aug 26, 2019

Fixed the custom decision tree params

8a54ccc

kiudee requested changes Aug 26, 2019

View reviewed changes

timokau and others added 24 commits October 9, 2019 17:46

Pin keras to a version <2.3

6c4c30c

Keras 2.3 renamed `lr` to `learning_rate`. It is not entirely clear what convention `tf.keras` will follow (cf. keras-team/keras#13393), therefore we will deal with this when we transition to tf2 and tf.keras.

Remove old implementations of label rankers

91b8a71

Parallelize travis build

fb92f9e

Reformated the code

7271c11

Save State

Issue kiudee#53

85ebe18

Renamed the package to choice functions Added a class for standardizing the features Removed the bug in predicting the scores using pairs for feta discrete choice Reformatted the code

Issue kiudee#51 Enhanced pymc3 tests

6ac62f9

-Added the ignore folder in gitignore

Fixed tests and speed up the tests

8130ddf

Added verbose for progressbar for choice functions

621f6ca

- Added another baseline for Object Ranking problem

Removed bugs in experiments

a3f2973

Removed a bug in choice functions

72daada

Implement the Expected Reciprocal Rank metric

7dcabc7

From http://olivier.chapelle.cc/pub/err.pdf.

Enable doctests in pytest

9e8a826

Add module documentation for metrics

b55272e

Document the ranking-ordering conversion

2388d89

Fix accidental escapes in strings (kiudee#68)

07a6faf

We need to escape backslashes in latex code, since python otherwise interprets \c as an escaped unicode character. Fixes kiudee#59

Fix "leaner" typo

5b980e2

Make the library compatible with a newer keras

ba03234

Apparently tf.keras now uses[1] `learning_rate` instead of `lr` too, so we should switch. [1]keras-team/keras#13393

Ignore numpy warning that is caused by theano

229d5dd

Out of our control. Fixes kiudee#74

Major Refactor. Rewritten the whole class. First version after refactor

6890f93

Some minor fixes and fixed the tunables parameter setting

74f2442

Initial Commit. Added primary lambdamart class implementation and its…

4390b44

… metric calculation related implementation. Also added the ranker to the tests

Added ranker to constants, made changes to tests and init

994b0a0

srinikrish22 added 5 commits January 14, 2020 00:11

Resolved error with init signature

f8c03b1

Made changes based on previous comments, minor fixes and add ranker t…

1c45ca3

…o object ranking constants list

Whitespace fixing

95476c8

Minor changes to fit function signature

9779b9e

Moved the query_lambdas function outside the class to fix Pool error

b0f9d3f

srinikrish22 added 16 commits January 15, 2020 23:07

Initial Commit. Added primary lambdamart class implementation and its…

28b7bc2

… metric calculation related implementation. Also added the ranker to the tests

Added ranker to constants, made changes to tests and init

420667f

Fixed the custom decision tree params

438ce52

Major Refactor. Rewritten the whole class. First version after refactor

1833431

Some minor fixes and fixed the tunables parameter setting

965a459

Major Refactor. Rewritten the whole class. First version after refactor

340c8c7

Some minor fixes and fixed the tunables parameter setting

9cf1b5b

Attempt to get rid of the Imputer error in Travis CI

e1fe742

Fixed indentation error in dcg metric

7e56a46

Fixed parameter errors in main class

255ac8e

Resolved error with init signature

5ddef22

Made changes based on previous comments, minor fixes and add ranker t…

067b5c9

…o object ranking constants list

Whitespace fixing

c9cd20f

Minor changes to fit function signature

26c9e3a

Moved the query_lambdas function outside the class to fix Pool error

113745c

Merge branch 'lambdamart-implementation' of https://github.com/srinik…

8379589

…rish22/cs-ranking into lambdamart-implementation

timokau closed this Mar 26, 2020

Conversation

srinikrish22 commented Aug 26, 2019

Description

Motivation and Context

How Has This Been Tested?

Does this close/impact existing issues?

Types of changes

Checklist:

Uh oh!

coveralls commented Aug 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 629

💛 - Coveralls

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kiudee commented Jan 15, 2020

Uh oh!

prithagupta commented Jan 15, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

srinikrish22 commented Jan 15, 2020

Uh oh!

prithagupta commented Jan 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kiudee commented Jan 20, 2020

Uh oh!

srinikrish22 commented Jan 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

srinikrish22 commented Jan 27, 2020

Uh oh!

timokau commented Mar 26, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

coveralls commented Aug 26, 2019 •

edited

Loading

prithagupta commented Jan 15, 2020 •

edited

Loading

prithagupta commented Jan 17, 2020 •

edited

Loading

srinikrish22 commented Jan 21, 2020 •

edited

Loading